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AGGTCGCAGGCGGGCGTGCGTGGAGCGGGGGCCGCGGCCGCGCCGCAGAGATGTGACTCGGGCCGAAGGC 

CAGCTGGAGCGTCGGCGCTGCGGGGCCGCGGGGGTCGAATGTTCGTGGCATCAGAGAGAAAGATGAGAGC 

TCACCAGGTGCTCACCTTCCTCCTGCTCTTCGTGATCACCTCGGTGGCCTCTGAAAACGCCAGCACATCC 

CGAGGCTGTGGGCTGGACCTCCTCCCTCAGTACGTGTCCCTGTGCGACCTGGACGCCATCTGGGGCATTG 

TGGTGGAGGCGGTGGCCGGGGCGGGCGCCCTGATCACACTGCTCCTGATGCTCATCCTCCTGGTGCGGCT 

GCCCTTCATCAAGGAGAAGGAGAAGAAGAGCCCTGTGGGCCTCCACTTTCTGTTCCTCCTGGGGACCCTG 

GGCCTCTTTGGGCTGACGTTTGCCTTCATCATCCAGGAGGACGAGACCATCTGCTCTGTCCGCCGCTTCC 

TCTGGGGCGTCCTCTTTGCGCTCTGCTTCTCCTGCCTGCTGAGCCAGGCATGGCGCGTGCGGAGGCTGGT 

GCGGCATGGCACGGGCCCCGCGGGCTGGCAGCTGGTGGGCCTGGCGCTGTGCCTGATGCTGGTGCAAGTC 

ATCATCGCTGTGGAGTGGCTGGTGCTCACCGTGCTGCGTGACACAAGGCCAGCCTGCGCCTACGAGCCCA 

TGGACTTTGTGATGGCCCTCATCTACGACATGGTACTGCTTGTGGTCACCCTGGGGCTGGCCCTCTTCAC 

TCTGTGCGGCAAGTTCAAGAGGTGGAAGCTGAACGGGGCCTTCCTCCTCATCACAGCCTTCCTCTCTGTG 

CTCATCTGGGTGGCCTGGATGACCATGTACCTCTTCGGCAATGTCAAGCTGCAGCAGGGGGATGCCTGGA 

ACGACCCCACCTTGGCCATCACGCTGGCGGCCAGCGGCTGGGTCTTCGTCATCTTCCACGCCATCCCTGA 

GATCCACTGCACCCTTCTGCCAGCCCTGCAGGAGAACACGCCCAACTACTTCGACACGTCGCAGCCCAGG 

ATGCGGGAGACGGCCTTCGAGGAGGACGTGCAGCTGCCGCGGGCCTATATGGAGAACAAGGCCTTCTCCA 

TGGATGAACACAATGCAGCTCTCCGAACAGCAGGATTTCCCAACGGCAGCTTGGGAAAAAGACCCAGTGG 

CAGCTTGGGGAAAAGACCCAGCGCTCCGTTTAGAAGCAACGTGTATCAGCCAACTGAGATGGCCGTCGTG 

CTCAACGGTGGGACCATCCCAACTGCTCCGCCAAGTCACACAGGAAGACACCTTTGGTGAAAGACTTTAA 

GTTCCAGAGAATCAGAATTTCTCTTACCGATTTGCCTCCCTGGCTGTGTCTTTCTTGAGGGAGAAATCGG 

TAACAGTTGCCGAACCAGGCCGCCTCACAGCCAGGAAATTTGGAAATCCTAGCCAAGGGGATTTCGTGTA 

AATGTGAACACTGACGAACTGAAAAGCTAACACCGACTGCCCGCCCCTCCCCTGCCACACACACAGACAC 

GTAATACCAGACCAACCTCAATCCCCGCAAACTAAAGCAAAGCTAATTGCAAATAGTATTAGGCTCACTG 

GAAAATGTGGCTGGGAAGACTGTTTCATCCTCTGGGGGTAGAACAGAACCAAATTCACAGCTGGTGGGCC 

AGACTGGTGTTGGTTGGAGGTGGGGGGCTCCCACTCTTATCACCTCTCCCCAGCAAGTGCTGGACCCCAG 

GTAGCCTCTTGGAGATGACCGTTGCGTTGAGGACAAATGGGGACTTTGCCACCGGCTTGCCTGGTGGTTT 

GCACATTTCAGGGGGGTCAGGAGAGTTAAGGAGGTTGTGGGTGGGATTCCAAGGTGAGGCCCAACTGAAT 

CGTGGGGTGAGCTTTATAGCCAGTAGAGGTGGAGGGACCCTGGCATGTGCCAAAGAAGAGGCCCTCTGGG 

TGATGAAGTGACCATCACATTTGGAAAGTGATCAACCACTGTTCCTTCTATGGGGCTCTTGCTCTAATGT 

CTATGGTGAGAACACAGGCCCCGCCCCTTCCCTTGTAGAGCCATAGAAATATTCTGGCTTGGGGCAGCAG 

TCCCTTCTTCCCTTGATCATCTCGCCCTGTTCCTACACTTACGGGTGTATCTCCAAATCCTCTCCCAATT 

TTATTCCCTTATTCATTTCAAGAGCTCCAATGGGGTCTCCAGCTGAAAGCCCCTCCGGGAGGCAGGTTGG 

AAGGCAGGCACCACGGCAGGTTTTCCGCGATGATGTCACCTAGCAGGGCTTCAGGGGTTCCCACTAGGAT 

GCAGAGATGACCTCTCGCTGCCTCACAAGCAGTGACACCTCGGGTCCTTTCCGTTGCTATGGTGAAAATT 

CCTGGATGGAATGGATCACATGAGGGTTTCTTGTTGCTTTTGGAGGGTGTGGGGGATATTTTGTTTTGGT 

TTTTCTGCAGGTTCCATGAAAACAGCCCTTTTCCAAGCCCATTGTTTCTGTCATGGTTTCCATCTGTCCT 

GAGCAAGTCATTCCTTTGTTATTTAGCATTTCGAACATCTCGGCCATTCAAAGCCCCCATGTTCTCTGCA 

CTGTTTGGCCAGCATAACCTCTAGCATCGATTCAAAGCAGAGTTTTAACCTGACGGCATGGAATGTATAA 

ATGAGGGTGGGTCCTTCTGCAGATACTCTAATCACTACATTGCTTTTTCTATAAAACTACCCATAAGCCT 

TTAACCTTTAAAGAAAAATGAAAAAGGTTAGTGTTTGGGGGCCGGGGGAGGACTGACCGCTTCATAAGCC 

AGTACGTCTGAGCTGAGTATGTTTCAATAAACCTTTTGATATTTCTCAAAAAAAAAAAAAAAAAAAAAAA 

(SEQ ID N0:1) 
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MFVASERKMRAHQVLTFLLLFVITSVASENASTSRGCGLDLLPQYVSLCDLDAIWGIWEAVAG 

Q AGALITLLLMLILLVRLPFIKEKEKKSPVGLHFLFLLGTLGLFGLTFAFIIQEDETICSVRRFL 

R WGVLFALCFSCLLSQAWRVRRLVRHGTGPAGWQLVGLALCLMLVQVIIAVEWLVLTVLRDTRPA 

HP CAYEPMDFVMALIYDMVLLWTLGLALFTLCGKFKRWKLNGAFLLITAFLSVLIWVAWMTMYLF 

Ifj GNVKLQQGDAWNDPTLAITLAASGWVFVIFHAIPEIHCTLLPALQENTPNYFDTSQPRMRETAF 

^ EEDVQLPRAYMENKAFSMDEHNAALRTAGFPNGSLGKRPSGSLGKRPSAPFRSNVYQPTEMAW 

I"* LNGGT I PTAPP SHTGRHLW (SEQ ID NO: 2) 
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ru 
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underlined = deleted in targeting construct 

bold = sequence flanking Neo insert in targeting construct 



AGGTCGCAGGCGGGCGTGCGTGGAGCGGGGGCCGCGGCCGCGCCGCAGAGATGTGACTCG 
GGCC GAAGGC C AGC TGGAGC GTC GGC GCTGC GGGGC C GC GGGGGTCGAATGTTCGTGGCA 
TCAGAGAGAAAGATGAGAGCTCACCAGGTGCTCACCTTCCTCCTGCTCTTCGTGATCACC 
TCGGTGGCCTCTGAAAACGCCAGCACATCCCGAGGCTGTGGGCTGGACCTCCTCCCTCAG 
TACGTGTCCCTGTGCGACCTGGACGCCATCTGGGGCATTGTGGTGGAGGCGGTGGCCGGG 
GCGGGCGCCCTGATCACACTGCTCCTGATGCTCATCCTCCTGGTGCGGCTGCCCTTCATC 
AAGGAGAAGGAGAAGAAGAGCCCTGTGGGCCTCCACTTTCTGTTCCTCCTGGGGACCCTG 
GGCCTCTTTGGGCTGACGTTTGCCTTCATCATCCAGGAGGACGAGACCATCTGCTCTGTC 

TGGC GC GTGC GGAGGC TGGTGC GGC ATGGC AC GGGC C CCGC GGGC TGGC AGC TG GTGGGC 
CTGGCGCTGTGCCTGATGCTGGTGCAAGTCATCATCGCTGTGGAGTGGCTGGTGCTCACC 
GTGCTGCGTGACACAAGGCCAGCCTGCGCCTACGAGCCCATGGACTTTGTGATGGCCCTC 
ATCTACGACATGGTACTGCTTGTGGTCACCCTGGGGCTGGCCCTCTTCACTCTGTGCGGC 
AAGTTCAAGAGGTGGAAGCTGAACGGGGCCTTCCTCCTCATCACAGCCTTCCTCTCTGTG 
CTCATCTGGGTGGCCTGGATGACCATGTACCTCTTCGGCAATGTCAAGCTGCAGCAGGGG 
GATGCCTGGAACGACCCCACCTTGGCCATCACGCTGGCGGCCAGCGGCTGGGTCTTCGTC 
ATCTTCCACGCCATCCCTGAGATCCACTGCACCCTTCTGCCAGCCCTGCAGGAGAACACG 
CCCAACTACTTCGACACGTCGCAGCCCAGGATGCGGGAGACGGCCTTCGAGGAGGACGTG 
CAGCTGCCGCGGGCCTATATGGAGAACAAGGCCTTCTCCATGGATGAACACAATGCAGCT 
CTCCGAACAGCAGGATTTCCCAACGGCAGCTTGGGAAAAAGACCCAGTGGCAGCTTGGGG 
AAAAGACCCAGCGCTCCGTTTAGAAGCAACGTGTATCAGCCAACTGAGATGGCCGTCGTG 
CTCAACGGTGGGACCATCCCAACTGCTCCGCCAAGTCACACAGGAAGACACCTTTGGTGA 
AAGACTTTAAGTTCCAGAGAATCAGAATTTCTCTTACCGATTTGCCTCCCTGGCTGTGTC 
TTTCTTGAGGGAGAAATCGGTAACAGTTGCCGAACCAGGCCGCCTCACAGCCAGGAAATT 
TGGAAATCCTAGCCAAGGGGATTTCGTGTAAATGTGAACACTGACGAACTGAAAAGCTAA 
CACCGACTGCCCGCCCCTCCCCTGCCACACACACAGACACGTAATACCAGACCAACCTCA 
ATC C C C GC AAAC T AAAGC AAAGC TAATTGC AAAT AGT ATT AGGC TC AC TGG AAAATGTGG 
CTGGGAAGACTGTTTCATCCTCTGGGGGTAGAACAGAACCAAATTCACAGCTGGTGGGCC 
AGACTGGTGTTGGTTGGAGGTGGGGGGCTCCCACTCTTATCACCTCTCCCCAGCAAGTGC 
TGGACCCCAGGTAGCCTCTTGGAGATGACCGTTGCGTTGAGGACAAATGGGGACTTTGCC 
ACCGGCTTGCCTGGTGGTTTGCACATTTCAGGGGGGTCAGGAGAGTTAAGGAGGTTGTGG 
GTGGGATTCCAAGGTGAGGCCCAACTGAATCGTGGGGTGAGCTTTATAGCCAGTAGAGGT 
GGAGGGACCCTGGCATGTGCCAAAGAAGAGGCCCTCTGGGTGATGAAGTGACCATCACAT 
TTGGAAAGTGATCAACCACTGTTCCTTCTATGGGGCTCTTGCTCTAATGTCTATGGTGAG 
AACACAGGCCCCGCCCCTTCCCTTGTAGAGCCATAGAAATATTCTGGCTTGGGGCAGCAG 
TCCCTTCTTCCCTTGATCATCTCGCCCTGTTCCTACACTTACGGGTGTATCTCCAAATCC 
TCTCCCAATTTTATTCCCTTATTCATTTCAAGAGCTCCAATGGGGTCTCCAGCTGAAAGC 
CCCTCCGGGAGGCAGGTTGGAAGGCAGGCACCACGGCAGGTTTTCCGCGATGATGTCACC 

T AGC AGGGC TTC AGGGGTTC C C AC T AGGATGC AGAG ATG AC C TC TC GC TGC C TC AC AAGC 
AGTGACACCTCGGGTCCTTTCCGTTGCTATGGTGAAAATTCCTGGATGGAATGGATCACA 
TGAGGGTTTCTTGTTGCTTTTGGAGGGTGTGGGGGATATTTTGTTTTGGTTTTTCTGCAG 
GTTCCATGAAAACAGCCCTTTTCCAAGCCCATTGTTTCTGTCATGGTTTCCATCTGTCCT 
GAGCAAGTCATTCCTTTGTTATTTAGCATTTCGAACATCTCGGCCATTCAAAGCCCCCAT 
GTTCTCTGC AC TGTTTGGCC AGC ATAACCTCTAGCATCGATTC AAAGC AGAGTTTTAACC 
TGACGGCATGGAATGTATAAATGAGGGTGGGTCCTTCTGCAGATACTCTAATCACTACAT 
TGCTTTTTCTATAAAACTACCCATAAGCCTTTAACCTTTAAAGAAAAATGAAAAAGGTTA 
GTGTTTGGGGGCCGGGGGAGGACTGACCGCTTCATAAGCCAGTACGTCTGAGCTGAGTAT 
GTTTCAATAAACCTTTTGATATTTCTCAAAAAAAAAAAAAAAAAAAAAAA 
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Gene Sequence 
Structure * 



526 bp 



Sequence Deleted 



594 bp 



Size of full-length 
cDNA: 2870 bp 
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Targeting Vector* 
(genomic sequence) 



Arm Length: 
5': 4.2 kb 
3': 0.7 kb 



— Targeting Vector 
- Endogenous Locus 

* Not drawn to scale 



5* arm 



LacZ-Neo 
Cassette 



3* arm 




5 1 >TGATGCTCATTCTCCTAGTGA 
G AC T AC C C TTC ATC AAGG AC AAGG 
AAAGG AAGC GGC C TGTGTGC C TC C 
ATTTC C TC TTC C TGC TGGGG AC CC 
TGGGC C TC TTTGGC C TGAC GTTTG 
CCTTCATCATCCAGATGGACGAGA 
CAATCTGCTCCATCCGACGCTTCC 
TCTGGGGTGTCCTCTTCGCGCTCT 
GCTTTTCCGCT<3 1 (SEQ ID 
NO:3) 



5 1 >GTGAGCCTGGCACTGTGCCTG 
ATGC TGGTGC AGGTC ATC ATTGC C 
AC TG AGTGGC TGGTGC TGAC TGTG 
C TGC GTG AC AC G AAGC C AGC C TGC 
GC C T AC G AGC C C ATGG ATTTTGTG 
ATGGCGCTCATCTACGACATGGTG 
CTGCTGGCCATCACCCTGGCCCAG 
TCCCTCTTCACGCTGTGTGGCAAG 
TTCAAACGGTG<3 1 (SEQ ID 
NO:4) 
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